A multiple hold-out framework for Sparse Partial Least Squares

نویسندگان

  • João M. Monteiro
  • Anil Rao
  • John Shawe-Taylor
  • Janaina Mourão-Miranda
چکیده

BACKGROUND Supervised classification machine learning algorithms may have limitations when studying brain diseases with heterogeneous populations, as the labels might be unreliable. More exploratory approaches, such as Sparse Partial Least Squares (SPLS), may provide insights into the brain's mechanisms by finding relationships between neuroimaging and clinical/demographic data. The identification of these relationships has the potential to improve the current understanding of disease mechanisms, refine clinical assessment tools, and stratify patients. SPLS finds multivariate associative effects in the data by computing pairs of sparse weight vectors, where each pair is used to remove its corresponding associative effect from the data by matrix deflation, before computing additional pairs. NEW METHOD We propose a novel SPLS framework which selects the adequate number of voxels and clinical variables to describe each associative effect, and tests their reliability by fitting the model to different splits of the data. As a proof of concept, the approach was applied to find associations between grey matter probability maps and individual items of the Mini-Mental State Examination (MMSE) in a clinical sample with various degrees of dementia. RESULTS The framework found two statistically significant associative effects between subsets of brain voxels and subsets of the questions/tasks. COMPARISON WITH EXISTING METHODS SPLS was compared with its non-sparse version (PLS). The use of projection deflation versus a classical PLS deflation was also tested in both PLS and SPLS. CONCLUSIONS SPLS outperformed PLS, finding statistically significant effects and providing higher correlation values in hold-out data. Moreover, projection deflation provided better results.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sparse partial least squares regression for simultaneous dimension reduction and variable selection

Partial least squares regression has been an alternative to ordinary least squares for handling multicollinearity in several areas of scientific research since the 1960s. It has recently gained much attention in the analysis of high dimensional genomic data. We show that known asymptotic consistency of the partial least squares estimator for a univariate response does not hold with the very lar...

متن کامل

Efficient Hold-Out for Subset of Regressors

Hold-out and cross-validation are among the most useful methods for model selection and performance assessment of machine learning algorithms. In this paper, we present a computationally efficient algorithm for calculating the hold-out performance for sparse regularized least-squares (RLS) in case the method is already trained with the whole training set. The computational complexity of perform...

متن کامل

Diagnosis and prognosis of osteoarthritis by texture analysis using sparse linear models

We present a texture analysis methodology that combines uncommitted machine-learning techniques and sparse feature transformation methods in a fully automatic framework. We compare the performances of a partial least squares (PLS) forward feature selection strategy to a hard threshold sparse PLS algorithm and a sparse linear discriminant model. The texture analysis framework was applied to diag...

متن کامل

Simultaneous Spectrophotometric Determination of Iron, Cobalt and Copper by Partial Least-Squares Calibration Method in Micellar Medium

Iron, cobalt and copper are metals, which appear together in many real samples, both natural and artificial. Recently a classical univariate micellar colorimetric method has been developed for determination of these metal ions. The organized molecular assemblies such as micelles are used in spectroscopic measurements due to their possible effects on the systems of interest. The ability of mi...

متن کامل

Simultaneous Spectrophotometric Determination of Iron, Cobalt and Copper by Partial Least-Squares Calibration Method in Micellar Medium

Iron, cobalt and copper are metals, which appear together in many real samples, both natural and artificial. Recently a classical univariate micellar colorimetric method has been developed for determination of these metal ions. The organized molecular assemblies such as micelles are used in spectroscopic measurements due to their possible effects on the systems of interest. The ability of mi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 271  شماره 

صفحات  -

تاریخ انتشار 2016